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(54) Information record medium, apparatus for recording the same and apparatus for 
reproducing the same 



(57) An information record medium (1: DVD) has a 
record track (1a) to be reproduced by an information re- 
producing apparatus (S2). The information reproducing 
apparatus is provided with a read device (80), reproduc- 
es audio information while relatively moving the read de- 
vice along the record track recorded with at least the 
audio information by a predetermined unit of audio 
frame and by a unit of frame group (GOF) consisting of 



consecutive audio frames in a predetermined number. 
A plurality of audio packets (43, 202) are arranged along 
the record track, in each of which audio information piec- 
es (207) constructing the audio information sampled by 
a predetermined sampling frequency and audio control 
information (204, 205, 206) for controlling a reproduc- 
tion and an access of the audio information pieces are 
respectively recorded. 
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Description 

The present invention relates to an information record medium such as an optical disk of a high recording density 
type, which is capable of recording information such as video information, audio information and the like at a high 
5 density, and which is represented by a DVD (Digital Video or Versatile Disk). The present invention also relates to a 
recording apparatus for recording the information onto the information record medium, and a reproducing apparatus 
for reproducing the information from the information record medium. 

Conventionally, a so-called LD (Laser Disk) and a so-called CD (Compact Disk) are generalized as optical disks, 
on which information such as video information, audio information and the like is recorded. 
io On the LD, the CD or the like, the video information and the audio information are recorded together with time 

information indicating a time at which each information is to be reproduced with respect to a reproduction start position, 
which each LD or the like has, as a standard position. Thus, other than a general normal reproduction to reproduce 
the recorded information in the order of recording, various special reproductions are possible, such as a reproduction 
to extract and listen to an only desirable music out of a plurality of recorded musics, a reproduction to listen to the 
15 recorded musics in a random order and so on, in case of the CD, for example. 

The audio information or video information in the information recording apparatus and the information reproducing 
apparatus for the CD, the LD, etc., is treated by a unit of so called audio frame or video frame respectively at the time 
of recording, editing and reproducing it, and can be accessed by this unit of frame. 

However, there is a problem that, according to the above mentioned CD, LD or the like, a so-called interactive and 
20 variegated reproduction is not possible in which the audience can have a plurality of selection branches as for the 
video or audio information to be displayed or sound-outputted and in which the audience can select them to watch or 
listen to it. 

Namely, for example, in case of giving audience to a foreign movie on the LD, it is not possible to select one of 
languages to be used for a subtitle (caption) displayed on the picture plane (e. g., select one of the subtitle in Japanese 

25 and the subtitle in the original language) so as to display the subtitle in the selected language, or, in case of giving 
audience to a music recorded on the CD, it is not possible to select one of sound voices of the music (e. g., select one 
of the English lyric and the Japanese lyric). 

On the other hand, various proposals and developments are being made as for the DVD, as an optical disk in 
which the memory capacity is improved by about ten times without changing the size of the optical disk itself as com- 

30 pared with the aforementioned conventional CD. According to the knowledge of the present inventors, it is anticipated 
that it is possible, as for the DVD having such a large memory capacity, to: divide audio information, video information 
or the like by an appropriate length respectively into an audio pack, a video pack or the like; add additional information 
such as header information to each pack; switch and time-axis-multiplex these packs; and reproduced the multiplexed 
audio information, video information or the like. 

35 However, in order to access the audio information and the like recorded by using the complex technique of packing 

and multiplexing in this way, by the unit of frame that is a unit by which the audio information and the like are reproduced 
as mentioned above or by a group unit consisting of a plurality of the frames, it is anticipated that complex information 
processes are needed for the recording apparatus and/or the reproducing apparatus. Further, in the technical art of 
the DVD, the actuality is such that a person having an ordinary skill in this art does not even recognize the subject 

40 itself of accessing the audio information and the like recorded on the DVD in this way, by the unit of audio frame or by 
the group unit consisting of a plurality of the frames, which is conventionally used for the CD. 

It is therefore an object of the present invention to provide an information record medium, in which audio information 
to be recorded, edited and reproduced by the unit of audio frame is recorded in an appropriately divided and multiplexed 
form, and in which the audio information can be accessed by the unit of audio frame in a relatively simple and sure 

45 manner, an information recording apparatus for recording the information onto the information record medium and an 
information reproducing apparatus for reproducing the information from the information record medium. 

The above object of the present invention can be achieved by an information record medium having a record track 
to be reproduced by an information reproducing apparatus, which is provided with a read device, reproduces audio 
information while relatively moving the read device along the record track recorded with at least the audio information 

so by a predetermined unit of audio frame and by a unit of frame group consisting of consecutive audio frames in a 
predetermined number, and is able to access a particular audio frame in a particular frame group. In the information 
record medium, a plurality of audio packets are arranged along the record track, in each of which audio information 
pieces constructing the audio information sampled by a predetermined sampling frequency and audio control informa- 
tion for controlling a reproduction and an access of the audio information pieces by the information reproducing appa- 

55 ratus are respectively recorded. And that, the audio control information comprises frame boundary number information 
indicating the number of boundaries of audio frames existing in the audio packet including the audio control information, 
and frame number information indicating a serial frame number of the audio frame at which the boundary firstly appears 
in the audio packet including the audio control information. 
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According to the information record medium of the present invention, the audio packets are arranged along the 
record track. In each of the audio packets, the audio information pieces, which constructs the audio information sampled 
by the predetermined sampling frequency, and the audio control information for controlling the reproduction and the 
access of the audio information pieces are respectively recorded. And that, the audio control information comprises 

5 the frame the boundary number information, which indicates the number of boundaries of audio frames existing in the 
pertinent audio packet, and the frame number information, which indicates a serial frame number of the audio frame 
at which the boundary firstly appears in the pertinent audio packet. Asa result, in the information reproducing apparatus, 
by reading these frame boundary number information and frame number information for each audio packet, the position 
of the audio frame in each audio packet can be detected or determined by a simple algorithm, on the basis of these 

10 frame boundary number information and frame number information for each audio packet. Thus, it is possible to access 
the particular audio frame in the particular frame group (e. g. the GOF) in the relatively simple and sure manner. Further, 
even if a dropout (i.e. a partial deletion of the read information) occurs, the current position can be easily specified by 
the unit of audio frame, by referring to the frame boundary number information and the frame number information, 
which are stored in each audio packet. 

15 Accordingly, it is possible to realize an information record medium, such as a DVD or the like, on which the access 

can be easily performed. 

In one aspect of the information record medium of the present invention, the audio control information further 
comprises frame group number information indicating a serial group number of the frame group including the audio 
frame at which the boundary firstly appears. 

20 According to this aspect, in the information reproducing apparatus, by reading the frame group number information 

as well as the frame boundary number information and the frame number information for each audio packet, the position 
of the audio frame in each audio packet can be detected or determined by a simple algorithm, on the basis of these 
frame boundary number information, frame number information and the frame group number information for each audio 
packet. Thus, it is possible to access the particular audio frame in the particular frame group in the relatively simple 

2$ and sure manner. Further, even if the large dropout occurs such that it becomes unknown in which frame group or in 
which audio frame the current position is located, the current position can be easily specified by the unit of frame group 
and the unit of audio frame, by referring to the frame group number information, the frame boundary number information 
and the frame number information stored in each audio packet. 

Accordingly, it is possible to realize an information record medium, such as a DVD or the like, on which the access 

30 can be more easily performed. 

In another aspect of the information record medium of the present invention, the audio control information further 
comprises information indicating the sampling frequency, the number of quantized bits of the audio information and 
the number of channels of the audio information, respectively. 

According to this aspect, in the information reproducing apparatus, the sampling frequency, the number of quan- 

35 tized bits of the audio information and the number of channels of the audio information, which are necessary for the 
reproduction and the access, can be obtained easily by the reproduction. Thus, in case that a plurality of audio frames 
are included in one packet, the current position of the audio frame, which is not at the head in the packet, can be easily 
specified by performing a byte counting operation. Incidentally, these informations may be set in the information re- 
producing apparatus by means of a key input. 

40 Accordingly, it is possible to realize an information record medium, such as a DVD or the like, on which the access 

can be extremely easily performed. 

In another aspect of the information record medium, having the record track to be reproduced by the information 
reproducing apparatus, which reproduces video information by a predetermined video frame unit in addition to the 
audio information while relatively moving the read device along the record track recorded with at least the video infor- 

45 mation by the video frame unit in addition to the audio information. In this aspect, a plurality of video packets are 
multiplexed with the audio packets and arranged along the record track, in each of which video information pieces 
constructing the video information and video control information for controlling a reproduction of the video information 
pieces by the information reproducing apparatus are respectively recorded. 

According to this aspect, the video packets are multiplexed with the audio packets and arranged along the record 

50 track. In each of the video packets, the video information pieces constructing the video information and the video control 
information for controlling the reproduction of the video information pieces are respectively recorded. Thus, it is possible 
to record both of the audio information and the video information in timely balanced manner. 

Accordingly, if reproducing it by the information reproducing apparatus, it is possible to obtain both of the audio 
sound output and the video image output, which are timely balanced, from a single information record medium. 

55 The above object of the present invention can be also achieved by an information recording apparatus for recording 

information onto an information record medium having a record track to be reproduced by an information reproducing 
apparatus, which is provided with a read device, reproduces audio information while relatively moving the read device 
along the record track recorded with at least the audio information by a predetermined unit of audio frame and by a 
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unit of frame group consisting of consecutive audio frames in a predetermined number, and is able to access a particular 
audio frame in a particular frame group. The information recording apparatus is provided with: a record device for 
respectively recording, into a plurality of audio packets arranged along the record track, audio information pieces con- 
structing the audio information sampled by a predetermined sampling frequency and audio control information for 

5 controlling a reproduction and an access of the audio information pieces by the information reproducing apparatus, 
the audio control information comprising frame boundary number information indicating the number of boundaries of 
audio frames existing in the audio packet including the audio control information, and frame number information indi- 
cating a serial frame number of the audio frame at which the boundary firstly appears in the audio packet including the 
audio control information; and an input device for inputting at least one portion of the audio control information. 

w According to the information recording apparatus of the present invention, the audio information pieces constructing 

the audio information sampled by a predetermined sampling frequency are recorded into each of the audio packets, 
which are arranged along the record track, by the record device. As one portion of the audio control information for 
controlling the reproduction and the access of the audio information pieces is inputted by the input device, the audio 
control information, which comprises the frame boundary number information and the frame number information, are 

15 recorded into each of the audio packets, by the record device. Therefore, the above described information record 
medium of the present invention can be obtained. 

In one aspect of the information recording apparatus of the present invention, the record device records the audio 
control information, which further comprises frame group number information indicating a serial group number of the 
frame group including the audio frame at which the boundary firstly appears. 

20 According to this object, since the audio control information, which further comprises the frame group number 

information, is recorded by the record device, the above described one aspect of the information record medium can 
be obtained. 

The above object of the present invention can be also achieved an information reproducing apparatus for repro- 
ducing information from the above described information record medium of the present invention. The information 

25 reproducing apparatus is provided with: a read device for reading information recorded at a predetermined reading 
position on the record track; a movement device for relatively moving the read device along the record track or across 
the record track; an extract device for extracting the audio packet from the information read by the read device; an 
audio decode device for decoding the audio information included in the extracted audio packet in accordance with the 
audio control information; a specification device for specifying the particular audio frame in the particular frame group; 

30 and a control device for controlling the read device, the movement device and the extract device to access the specified 
particular audio frame in the particular frame group, on the basis of the frame boundary number information and the 
frame number information, which are included in the audio control information in the extracted audio packet. 

According to the information reproducing apparatus of the present invention, while the read device is relatively 
moved along the record track by the movement device, the information recorded at the predetermined reading position 

35 on the record track is read by the read device. Then, the audio packet is extracted from this read information by the 
extract device. Then, the audio information included in the extracted audio packet is decoded in accordance with the 
audio control information, by the audio decode device. Here, the particular audio frame in the particular frame group 
is specified as a target address, by the specification device. The specification here may be performed by directly 
inputting the frame group number and the audio frame number. Alternatively, the specification may be performed by 

40 specifying a time such as the hour, the minute, the second and so on, or by specifying all of these parameters. In 
summary, the frame group number and the audio frame number may be specified in any form, as long as these can 
be specified somehow. Then, under the control of the control device, the access to the specified particular audio frame 
in the particular frame group is performed on the basis of the frame boundary number information and the frame number 
information, which are included in the audio control information in the extracted audio packet, by the read device, the 

45 movement device and the extract device. Therefore, the position of the audio frame in each audio packet can be 
detected or determined by a simple algorithm, on the basis of these frame boundary number information and frame 
number information for each audio packet. Thus, it is possible to access the particular audio frame in the particular 
frame group in the relatively simple and sure manner. Further, even if the dropout occurs such that it becomes unknown 
in which audio frame the current position is located, the current position can be easily specified by the unit of audio 

50 frame, by referring to the frame boundary number information and the frame number information, which are stored in 
each audio packet. 

Accordingly, it is possible to perform the easy and sure access with respect to the above described information 
record medium of the present invention, and that the apparatus construction can be simplified and the production cost 
thereof can be reduced. 

55 in one aspect of the information reproducing apparatus of the present invention, the control device is provided with 

a counter device for counting information amount in the audio packet, and controls the record device the movement 
device and the extract device to successively approach the specified audio frame in the frame group while counting 
the information amount in the accessed audio packet by the counter device after accessing the audio packet including 
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the specified audio frame in the frame group, on the basis of the frame boundary number information and the frame 
number information. 

According to this aspect, under the control of the control device, the access to the audio packet, which includes 
the specified audio frame in the audio group, is firstly performed by the read device, the movement device and the 

5 extract device, on the basis of the frame boundary number information and the frame number information, which are 
included in the audio control information in the extracted audio packet. Then, under the control of the control device, 
the access is performed to successively approach the specified audio frame in the frame group while counting the 
information amount in the accessed audio packet by the counter device, by the read device, the movement device and 
the extract device. Therefore, it is enough for the counter device to perform the counting operation only within one 

10 audio packet. 

Accordingly, it is possible to perform the simple and sure access, by use of a simple hardware construction and a 
simple software construction, with respect to the above described information record medium of the present invention. 

In another aspect of the information reproducing apparatus of the present invention, the audio control information 
further comprises frame group number information indicating a serial group number of the frame group including the 
15 audio frame at which the boundary firstly appears. The control device controls the read device, the movement device 
and the extract device to access the specified audio frame in the audio group, further on the basis of the frame group 
number information. 

According to this aspect, under the control of the control device, the access to the specified audio frame in the 
frame group is performed on the basis of the frame group number information as well as the frame boundary number 

20 information and the frame number information included in the audio control information in the extracted audio packet, 
by the read device, the movement device and the extract device. Here, the position of the audio frame in each audio 
packet can be detected or determined by a simple algorithm, on the basis of these frame boundary number information, 
frame number information and frame group number information for each audio packet. Thus, it is possible to access 
the particular audio frame in the particular frame group in the relatively simple and sure manner. Further, even if the 

25 large dropout occurs such that it becomes unknown in which frame group or in which audio frame the current position 
is located, the current position can be easily specified by the unit of frame group and the unit of audio frame, by referring 
to the frame group number information, the frame boundary number information and the frame number information 
stored in each audio packet. 

Accordingly, it is possible to perform the simple and sure access with respect to the above described information 
30 record medium of the present invention, and that the apparatus construction can be simplified and the production cost 
thereof can be reduced. 

In this aspect, the control device may be provided with a counter device for counting information amount in the 
audio packet, and may control the record device, the movement device and the extract device to successively approach 
the specified audio frame in the frame group while counting the information amount in the accessed audio packet by 
35 the counter device after accessing the audio packet including the specified audio frame in the frame group, on the 
basis of the frame boundary number information, the frame number information and the frame group number informa- 
tion. 

According to this aspect, under the control of the control device, the access to the audio packet, which includes 
the specified audio frame in the audio group, is firstly performed by the read device, the movement device and the 
40 extract device, on the basis of the frame boundary number information, the frame number information and the frame 
group number information, which are included in the audio control information in the extracted audio packet. Then, 
under the control of the control device, the access is performed to successively approach the specified audio frame in 
the frame group while counting the information amount in the accessed audio packet by the counter device, by the 
read device, the movement device and the extract device. Therefore, it is enough for the counter device to perform 
45 the counting operation only within one audio packet. 

Accordingly, it is possible to perform the simple and sure access, by use of a simple hardware construction and a 
simple software construction, with respect to the above described information record medium of the present invention. 

The nature, utility, and further features of this invention will be more clearly apparent from the following detailed 
description with respect to preferred embodiments of the invention when read in conjunction with the accompanying 
50 drawings briefly described below. 

FIG. 1 is a diagram showing a physical structure of record information of a DVD as one embodiment of the present 
invention; 

FIG. 1 A is a perspective view of the DVD in FIG. 1 ; 
55 FIG. 2 is a diagram showing a logical structure of the record information of the DVD in FIG. t ; 

FIG. 3 is a diagram showing a structure of an interleaved unit of the DVD in FIG. 1 ; 
FIG. 4 is a table showing the specification of PCM linear audio data recorded on the DVD of FIG. 1 ; 
FIG. 5 is a diagram showing one example of the arrangement of sample data in the PCM linear audio data of FIG. 4; 
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FIG. 6 is a diagram showing another example of the arrangement of sample data in the PCM linear audio data of 
FIG. 4; 

FIG. 7 is a diagram showing another example of the arrangement of sample data in the PCM linear audio data of 
FIG. 4; 

s FIG. 8 is a diagram showing a physical data structure of the audio pack on the DVD of FIG. 1 ; 

FIG. 9 is a table showing the number of quantized bits, the number of samples and the data size of data in each 
packet for the specification of the PCM linear audio data of FIG. 4; 

FIG. 10 is a table showing a concrete data structure of the packet header of the audio pack of FIG. 8; 
FIG. 11 is a table indicating a size (the number of bytes) of an audio frame and the number of audio frames per 
10 packet for a specification of the PCM linear audio data in FIG.4; 

FIG. 1 2 is a conception view indicating a structure of the audio frames, integrated by a GOF unit, which is written 
to the audio pack of the DVD in FIG. 1 ; . 

FIG. 13 is a conception view indicating a relationship between the GOF and the audio pack, which are written to 
the audio pack of the DVD in FIG. 1 ; 
15 FIG. 14 is a conception view indicating a relationship between the audio frame and the audio pack, which are 

written to the audio pack of the DVD in FIG. 1 ; 

FIG. 15 is a table showing a concrete data structure in a private area in the packet of the audio pack of FIG. 8, in 
a first embodiment of the present invention; 

FIG. 16 is a table showing a concrete data structure in a private area in the packet of the audio pack of FIG. 8, in 
20 a second embodiment of the present invention; 

FIG. 17 is a block diagram of an information recording apparatus for recording the DVD in FIG. 1, as another 
embodiment of the present invention; 

FIG. 1 8 is a block diagram of an information reproducing apparatus for reproducing the DVD in FIG. 1 , as another 
embodiment of the present invention; 
25 FIG. 1 9 is a flow chart indicating one example of an access operation in the first embodiment of the reproducing 

apparatus of the present invention; 

FIG. 19A is a diagram showing various counters in a system controller of the first embodiment of the reproducing 
apparatus of the present invention; 

FIG. 20 is a conception view indicating a structure of audio frames integrated by the GOF unit for explaining the 
30 one example of the access operation in FIG. 19; 

FIG. 21 is a conception view indicating a structure of audio frames integrated by the GOF unit for explaining another 
example of the access operation in FIG. 19; 

FIG. 22 is a flow chart indicating one example of an access operation in the second embodiment of the reproducing 
apparatus of the present invention; 
35 FIG. 23 is a flow chart indicating one example of an access operation in a comparison aspect of the reproducing 

apparatus of the present invention; 

FIG. 24 is a conception view indicating a packet arrangement for explaining an operation of specifying a packet 
at a time of dropout in the first embodiment of the reproducing apparatus of the present invention; 
FIG. 25 is a conception view indicating a packet arrangement for explaining an operation of specifying a packet 
40 at a time of dropout in the second embodiment of the reproducing apparatus of the present invention; and 

FIG. 26 is a conception view indicating a packet arrangement for explaining an operation of specifying a packet 
at a time of dropout in the comparison aspect of the reproducing apparatus of the present invention. 

Referring to the accompanying drawings, embodiments of the present invention will be now explained. The fol- 
45 lowing explanations will be done for the embodiments, in which the present invention is applied to the aforementioned 
DVD. 

First of all, a physical structure and a logical structure as well as an operation of a DVD, as one embodiment of 
the information record medium to which the present invention is applied, will be explained with reference to FIGS. 1 
to 16. 

50 At first, a record format of video information and audio information on a record track of the DVD (i.e. a physical 

record format) is explained by use of FIG. 1. 

As shown in FIG. 1, a DVD 1 as the present embodiment is provided with a lead in area LI at its most inner 
circumferential portion and a lead out area LO at its most outer circumferential portion, between which video information 
and audio information are recorded along the record track such that they are divided into a plurality of VTSs 3, each 

55 of which has a unique ID (Identification) number (i.e. VTS#1 to VTS#n). Here, the VTS (Video Title Set) 3 is a set 
(bundle) of titles (each of which is one production or one work which an author or producer intends to offer to the 
audience), which are related to each other (e. g., which attribute, such as the number, the specification, the corre- 
sponding languages etc. of audio and video streams is the same to each other). More concretely, a plurality of movies 
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which are related to the same movie to each other but which languages of serifs (lines) are different from each other 
may be recorded as different titles respectively, or even in case of the same movies, the theater version and the special 
version may be recorded as different titles respectively. Ahead of the area where the VTSs 3 are recorded, a video 
manager 2 is recorded as shown in FIG. 1 . As the information recorded in the video manager 2, for example, information 
s related to the whole video and audio information recorded on the DVD 1 , such as a menu for accessing each title, 
information for preventing an illegal copy, an access table for directly accessing each title and so on, is recorded. 

These video, audio and control informations are recorded on a spiral or coaxial record track la of the DVD 1 as 
shown in FIG. 1 A. 

One VTS 3 is recorded such that it is divided into a plurality of VOBs 10, each of which has an ID number (VOB 
10 ID#1, VOBID#2, ...),andcontroldata11 disposed ahead of the VOBs 1 0. Here, a data portion constructed by a plurality 
of VOBs 10 is defined as a VOB set (VOBS) as shown in FIG. 1 . This VOB set is defined to distinguish the VOB 10, 
which constructs one portion of the VTS 3 as the substantia! portion of the video and audio information, from the control 
data 11, which constructs another portion of the VTS 3. 

In the control data 11 recorded at the head of the VTS 3, information such as PGCI (ProGram Chain Information), 
15 which is various information related to a program chain as a logical division obtained by combining a plurality of cells 
(the "cell" will be described later in detail), is recorded. In each VOB 10, the substantial portion of the video and audio 
information (i.e. the video and audio information itself other than control information) besides the control information 
are recorded. 

Further, one VOB 10 is constructed of a plurality of cells 20, each of which has an ID number (cell ID#1 , cell 
20 |D#2, ... ). Here, one VOB 10 is constructed such that it is completed by the plurality of cells 20 and that one cell 20 
does not strides over two VOBs 10. 

Nextly, one cell 20 is constructed of a plurality of VOB units (VOBUs) 30, each of which has an I D number (VOBU#1 , 
VOBU#2, ... ). Here, the VOB unit 30 is an information unit, each of which includes the video information, the audio 
information and sub picture information (which is defined as information of a sub picture such as a subtitle of a movie 
25 etc. ). 

One VOB unit 30 is provided with: a navi-pack (a navigation pack) 41 for storing various control data; a video pack 
42 for storing video data; an audio pack 43 for storing audio data; and a sub picture pack 44 for sub picture data. Here, 
in the video pack 42, a packet including the video data together with additional information such as header thereof is 
recorded. In the audio pack 43, a packet including the audio data together with additional information such as header 

30 thereof is recorded. Further, in the sub picture pack 44, a packet including graphics such as a character and a diagram 
as the sub picture, together with additional information such as header thereof, is recorded. In the video packs 42, 
which data amount is relatively large as shown in FIG. 1 , one or a plurality of GOPs are recorded within one VOB unit 
30. The audio pack 43 and the sub picture pack 44 are disposed intermittently between the video packs 42. 

It is prescribed by a standard specification of the DVD that there are 8 kinds of audio recordable on the DVD 1 

35 while 32 kinds of sub picture recordable on the DVD 1 . Further, there always exists the navi-pack 41 in one VOBU 30. 
On the other hand, there may not exist each of the video pack 42, the audio pack 43 and the sub picture pack 44 in 
one VOBU 30, or, even in case that these packs exist in one VOBU 30, the number of the packs and the order of the 
packs are freely determined. 

Finally, the navi-pack 41 is provided with: a DSI (Data Search Information) packet 51 including search information 

^0 to search a video image or an audio sound desired to be displayed or sound-outputted (concretely, search information 
such as an address, where the video or audio to be displayed or soundoutputted is recorded, on the DVD 1); and a 
PCI (Presentation Control Information) packet 50 including information related to a display control at a time of displaying 
the video image or outputting the audio sound, which is searched on the basis of the information of the DSI packet 51 . 
Further, the video data included in one VOBU 30 consist of at least one GOP (Group Of Pictures). In the PCI packet 

45 50, high light information, which defines a display or operation at a time when one selection item is selected out of 
selection items by the audience, is included. By the high light information, for example, the change of the picture plane 
display as well as the display position to be changed with respect to the selection item selected on a special picture 
plane of selection items (i.e. a so-called menu picture plane) for the audience to select, and the command corresponding 
to the selected item (i.e. a command to be performed in correspondence with the selected item) are set. 

50 The video information to construct and display a frame, a selection button and the like, which is required to construct 

and display the menu picture plane, is recorded in the sub picture pack 44 as the sub picture information. 

Further, the above described GOP is a minimum picture unit, which can be independently reproduced and which 
is determined by a standard based on the MPEG (Moving Picture Experts Group) 2 method. The MPEG 2 method is 
a picture compression method adopted at a time of recording the video information onto the DVD 1 in the present 

55 embodiment. 

Here, the outline of the MPEG 2 method is explained. Namely, in general, frame pictures forward and backward 
of one frame picture in continuous frame pictures are often resembled to each other and have mutual relationships. 
The MPEG 2 method is a method, which is proposed by paying an attention to this fact, and which generates one 
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frame picture existing between a plurality of frame pictures on the basis of the plurality of frame pictures transferred 
while they are timely separated by a few or several frames from each other, by means of an interpolating calculation 
based on moving vectors etc. of the original picture. In this case, if this one frame picture is to be recorded, it is enough 
to just record the information as for a differential vector and a moving vector thereof with respect to the plurality of 
5 frame pictures, so as to reproduce this one frame picture by estimating it from the plurality of frame pictures with 
referring to these vectors, at a time of reproduction. By this, the compression recording is enabled with respect to the 
picture. 

The MPEG 2 method used in the present embodiment employs a variable rate method, in which the data amount 
included in each GOP is not constant. 
10 in the above explained record format having a hierarchical structure as shown in FIG. 1 , each division can be freely 

set according to the author's intention, so as to perform recording on the basis of these set divisions. By performing 
the reproduction on the basis of a later described logical structure for each of these divisions, the variegated repro- 
duction can be performed. 

Nextly, a logical format (logical structure) constructed by combining the information recorded by the physical divi- 

15 sions shown in FIG. 1 is explained with reference to FIG. 2. The information is not actually recorded on the DVD 1 in 
the logical structure of FIG. 2. Instead, information (e.g. access information or time information) to reproduce each 
data shown in FIG. 1 by combining them (especially combining the cells 20) in the logical structure shown in FIG. 2, 
is recorded on the DVD 1 , especially in the control data 11 . 

To make the explanation clear, the following explanation is made from the lower hierarchical layer in FIG. 2. One 

20 program 60 is logically constructed on the basis of the author's intention by selecting and combining a plurality of cells 
20 among the physical structures explained by use of FIG. 1. The program 60 is also a minimum logical unit, which 
division can be identified by a system controller of a reproducing apparatus described later and which can be accessed 
by use of a command by the system controller. It is also possible for the author to define a gathering of one or more 
programs 60 as a minimum unit, which can be freely selected to be watched or listened to by the audience and which 

25 is referred to as a PTT (ParT of Title). 

Since one program 60 is logically constructed by selecting a plurality of cells 20, it is possible to use one cell 
commonly for a plurality of programs 60, namely to perform a so-called "alternative usage" of the cell 20 in which one 
cell 20 is reproduced in a plurality of different programs 60. 

Here, as for the number of each cell 20, at a time of treating the cell 20 on the physical format shown in FIG. 1 , 

30 the number is treated as the cell ID number (which is indicated by cell ID in FIG. 1). On the other hand, at a time of 
treating the cell 20 on the logical format shown in FIG. 2, the number is treated as the cell number in the order of 
description in the PGCI described later. 

Next, by combining a plurality of programs 60, one PGC (Program Chain) 61 is logically constructed on the basis 
of the author's intention. The aforementioned PGCI (ProGram Chain Information) is defined by a unit of the PGC 61 . 

35 The PGCI includes information indicating: the reproduction order of the cells 20 for each program 60 at a time of 
reproducing each program 60 (by this reproduction order, a unique program number (#1, #2, ... ) is assigned to each 
program 60); the reproduction order for each cell 20 (by this reproduction order, a unique cell number is assigned to 
each cell 20); an address which is a record position of each cell 20 on the DVD 1 ; the number of the cell 20 positioned 
at the head of one program 60 to be firstly reproduced; a reproduction method for each program 60 [it is possible for 

40 the author to select one reproduction method for each PGC 61 out of (i) a random reproduction (which is a random 
reproduction by use of random numbers, and the same program 60 may be reproduced by a plurality of times), (ii) a 
shuffle reproduction (which is a random reproduction by use of random numbers in the same manner as the random 
reproduction, but one program 60 is reproduced just once but not reproduced by a plurality of times), (iii) a loop repro- 
duction (which is a reproduction to reproduce one PGC 61 repeatedly), and (iv) a combination of the loop reproduction 

45 with the random reproduction or the shuffle reproduction, as a reproduction method to be employed at a time of repro- 
duction]; and various commands (e. g. commands able to be specified by the author for each PGC 61 or each cell 20). 
The recording position of the PGCI on the DVD 1 may be in the control data 11 as aforementioned, or in a control data 
(not illustrated) in the video manager 2 if the pertinent PGCI is related to the menu in the video manager 2 (refer to 
FIG. 1). 

50 in one PGC 61 , the substantial video and audio data etc. are included as a combination of the programs 60 (in 

other words, the combination of the cells 20) other than the above mentioned PGCI. 

Further, in one PGC 61, it is possible to perform the alternative usage of the cell 20 as explained before in the 
explanation for the program 60 (i. e. such a usage that the same cell 20 is commonly used by a plurality of different 
PGC 61 ). As the method of reproducing the cell 20 to be used, the author can select a method of reproducing the cells 

55 20 in an order regardless of the recording order on the DVD 1 (i. e. the method of reproducing the cells discontinuously 
arranged, for example, the method of reproducing the cell 20 prior which is recorded posterior on the record track) 
other than a method of reproducing the cell 20 in the recording order on the record track on the DVD 1 as it is (i. e. the 
method of reproducing the cells continuously arranged). 
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Then, one title 62 is logically constructed of one or a plurality of PGCs 61 (PGC #1 , PGC#2, ...) as shown in FIG. 
2. The title 62 is, for example, a unit corresponding to one movie, and is completed information which the author would 
like to offer to the audience of the DVD 1 . 

Finally, one VTS 63 is logically constructed of one or a plurality of titles 62 (title #1 , title #2, ...) as shown in FIG. 

5 2. The title 62 included in the VTS 63 has the attributes common to each other. For example, the movies based on one 
movie but in different languages correspond to the respective titles 62. The information included in one VTS 63 shown 
in FIG. 2 corresponds to information included in one VTS 3 shown in FIG. 1. Namely, all information included in the 
logical VTS 63 shown in FIG. 2 is recorded as one VTS 3 in the DVD 1 shown in FIG. 1. 

As the author specifies the information divided in the physical structure on the DVD 1 on the basis of the above 

10 described logical format, the video image (e.g. the movie image) for the audience to watch is to be formed. 

In the explanations for the physical structure shown in FIG. 1, for the easy understanding of the content, it has 
been explained such that a plurality of cells 20 are recorded in the order of the ID numbers. However, in the DVD 1 of 
the present embodiment, one cell 20 may be divided into a plurality of interleaved units I U to be actually recorded on 
the DVD 1, as shown in FIG. 3. 

'5 Namely, as shown in FIG. 3, it is assumed that the author constructs one PGC 61 A of the cells 20 having the ID 

numbers 1 , 2 and 4, and constructs another PGC 61 B of the cells 20 having the ID numbers 1 , 3 and 4. In this case, 
at a time of reproducing the information from the DVD 1 on the basis of the PGC 61 A, only the cells having the ID 
numbers 1 , 2 and 4 are reproduced, while, at a time of reproducing the information from the DVD 1 on the basis of the 
PGC 61 B, only the cells 20 having the ID numbers 1 , 3 and 4 are reproduced. In the case of the PGC 61 A for example, 

20 jf the cells 20 are recorded spaced from each other for each ID number, a certain time period to jump the optical pickup 
from the record position of the cell 20 having the ID number 2 on the DVD 1 to the record position of the cell 20 having 
the ID number 4 on the DVD 1 is required in the reproduction. This results in that the continuous reproduction (here- 
inafter, it is referred to as a "seamless reproduction") of the cell 20 having the ID number 2 and the cell 20 having the 
ID number 4 may not be possible depending on a capacity of a track buffer of the reproducing apparatus described later. 

25 Therefore, in the case shown in FIG. 3, the cell 20 having the ID number 2 and the cell having the ID number 3 

are divided into interleaved units III and are recorded by the interleaved units IU, each having a length, which does 
not destroy the continuity of an output signal of the track buffer even if an input signal to the track buffer is temporarily 
stopped, in correspondence with an input and output processing speeds at the track buffer (i.e. the interleaved units 
IU, each having a length which allows the track buffer to continuously output the output signal even if the input signal 

30 to the track buffer is ceased while the optical pickup jumps for the interval of one interleaved unit IU). For example, in 
case of reproduction based on the PGC 61 A, only the interleaved units IU constructing the cell 20 corresponding to 
the ID number 2 are continuously detected to be reproduced. In the same manner, in case of reproduction based on 
the PGC 61 B, only the interleaved units IU constructing the cell 20 corresponding to the ID number 3 are continuously 
detected to be reproduced. The length of the interleaved unit IU may be determined with considering the capability of 

35 a driving mechanism such as a slider motor to perform the track jump, in addition to the memory capacity of the track 
buffer. 

In this manner, by dividing one cell 20 into a plurality of interleaved units IU and recording them according to the 
author's intention, the signal outputted from the track buffer can be continuous even at a time of reproducing the PGC 
61 including the cells 20 having the discontinuous ID numbers, so that it is possible for the audience to watch continuous 

40 reproduction video image. 

Each interleaved unit IU is completed in one VOB 10, and does not stride over two adjacent VOBs 10. As for the 
relationship between the interleaved unit IU and the VOB unit 30, one or a plurality of VOB units 30 are included in 
one interleaved unit IU. One VOB unit 30 is completed in one interleaved unit IU, and is not divided into a plurality of 
interleaved units IU or does not strides over a plurality of interleaved units IU. 

^5 Nextly, among the video information and the audio information having the above mentioned physical structure and 

logical structure, the audio pack 43 especially related to the present invention is explained in detail with reference to 
FIGS. 4 and 16. 

The audio information in the DVD of the present embodiment consists of linear audio data based on the PCM 
(Pulse Code Modulation) method (hereinbelow, it is referred to as "PCM linear audio data") having the specification 
50 as listed below, for example. 

Sampling frequency j 48 kHz or 96 kHz 

the number of quantized bits j 16 bits, 20 bits or 24 bits 

channel number ! 1 ch to 8 ch 

55 I i 1 

As a combination of the parameters in this specification, for example, if the combination of 96 kHz, 24 bits and 8 
ch is considered, the data rate of the PCM linear audio data becomes 18. 432 MHz (Mega Hz). In order to perform the 



9 



EP 0 795 870 A2 



reproduction by this data rate, a significantly high operation rate or speed is required to the reproducing apparatus and 
the electric power consumption accompanying the rotation number of the disk is also significantly large. Therefore, in 
the DVD 1 of the present embodiment, with considering these operation rate and electric power consumption of the 
hardware of the reproducing apparatus, the upper limit of the data rate of the audio data is set to 6.144 MHz. Thus, in 

s the DVD 1 of the present embodiment explained herein below, the PCM linear audio data is dealt with, which specifi- 
cation is as shown in a table of FIG. 4. In the table of FIG. 4, the expression of "48k/98k" in a column of the sampling 
frequency indicates that either one of 48 kHz and 96 kHz is available (the same thing can be said for the rest), while 
the expression of "16, 20, 24" in the column of the number of quantized bits indicates that either one of 16 bits, 20 bits 
and 24 bits is available (the same thing can be said for the rest). 

10 Here, some examples of the arrangement of the PCM linear audio data are shown in FIGS. 5 to 7, respectively 

FIG. 5 shows the arrangement of sample data 301 a in the 1 6 bits mode, FIG. 6 shows the arrangement of sample 
data 301b in the 20 bits mode, and FIG. 7 shows the arrangement of sample data 301c in the 24 bits mode. 

In FIG. 5, sample data S2n and Sn+1 respectively indicate the sample data, which sample order is 2n, and the 
sample data, which sample order is 2n+1 (n = 0, 1 , 2 and so forth on). Namely, each of data portions A2n, B2n, ... 

15 constructing sample data 302a in the figure indicates 16 bits of respective one of the channels. 

In FIG. 6, sample data S2n and S2n+1 respectively indicate upper 16 bits of the sample data, which sample order 
is 2n, and upper 16 bits of the sample data, which sample order is 2n+1. Sample data e2n and e2n+1 respectively 
indicate lower 4 bits of the sample data, which sample order is 2n, and lower 4 bits of the sample data, which sample 
order is 2n+1. Namely, each of data portions A2n, B2n, ... constructing sample data 302b in the figure indicates the 

20 upper 1 6 bits of respective one of the channels, while each of data portions a2n, b2n, ... constructing sample data 303b 
in the figure indicates the lower 4 bits of respective one of the channels. 

In FIG. 7, sample data S2n and S2n+1 respectively indicate upper 1 6 bits of the sample data, which sample order 
is 2n, and upper 16 bits of the sample data, which sample order is 2n+1. Sample data e2n and e2n+1 respectively 
indicate lower 8 bits of the sample data, which sample order is 2n, and lower 8 bits of the sample data, which sample 

25 order is 2n+1. Namely, each of data portions A2n, B2n, ... constructing the sample data 302c in the figure indicates 
the upper 16 bits of respective one of the channels, while each of data portions a2n, b2n, ... constructing the sample 
data 303c in the figure indicates the lower 8 bits of respective one of the channels. 

Either one of the data 301a, the data 301b and the data 301c of respective modes shown in FIGS. 5 to FIG. 7 is 
treated as one unit for every 2 samples corresponding to the number of channels. 

30 The PCM linear audio data having the above explained specification is modified or arranged into the MPEG 2 

system stream in accordance with the ISO/IEC 1381 8-1 standard. Namely, according to this standard, the PCM linear 
audio data is divided into data pieces each having such a length able to be stored in one audio pack 43 shown in FIG. 
1 , and the divided data pieces are stored (recorded) in respective audio packs 43. 

In this case, one audio pack 43 has a size of not more than 2048 bytes, for example, as shown in FIG. 8, and has 

35 one packet in one pack. More concretely, in FIG. 8, the audio pack 43 has: a pack header 201 of 14 bytes, which 
indicates the attribute information as for the whole of the pertinent audio pack, e.g., information indicating that the PCM 
linear audio data in the pertinent audio pack is based on the MPEG 2 system stream standard; and a packet 202 of 
2034 bytes, in which the substantial PCM linear audio data is stored. 

Then, the packet 202 is provided with: a packet header 203 indicating the attribute information as for the whole of 

40 the pertinent packet; a sub stream ID; audio frame information 205; audio data information 206; and the PCM linear 
audio data 207 of 2013 bytes, which is the substantial audio data pieces. 

The maximum number of the audio samples, which can be stored in each audio pack 43 (packet 202) constructed 
in the above mentioned manner, is indicated in a table of FIG. 9, as for respective combinations shown in FIG. 4. As 
understood from the table of FIG. 9, since the number of the audio samples is treated in the unit of 2 samples (refer 

<5 to FIGS. 5 to 7), the maximum number of the samples able to be stored in one packet 202 is always even. 

Nextly, the data structure of the packet header 203 is explained in more detail with reference to FIG. 10. 
In FIG. 10, the packet header 203 is provided with: a packet start code field 203a of 3 bytes indicating the start 
position of the pertinent packet; a stream ID field 203b indicating that the audio data is based on the standard of the 
private stream 1 ; and a packet length field 203c indicating the length of the pertinent packet. 

50 The packet header 203 especially includes a PTS (Presentation Time Stamp i.e. time management information 

for the reproduction output) field 203f, to which the PTS indicating the time to manage the timing of the reproduction 
output set to synchronize each audio stream is written. This PTS field 203f is added to the packet header 203 only if 
the head of the audio frame is located in the pertinent packet 202. On the contrary, if the head of the audio frame is 
not located in the pertinent packet 202, the PTS field 203f is not added, and that the space for this PTS field 203f is 

55 packed by the subsequent data as forward packing. Therefore, the number of bytes of this packet header 203 is variable 
depending upon the existence and non-existence of the PTS 203f. Further, if there are two heads of the audio frames 
are located in one packet 202, only the PTS corresponding to the first audio frame among these is added to the packet 
header 203. The PTS indicates the time, at which the head byte of the audio frame appeared in the pertinent packet 
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is to be outputted from the decoder. The PTS is specified by a unit of 90 kHz and by a length of 33 bits. The reason 
for adopting 90 kHz here is that it is a value of the common multiple of the frequencies of audio frames such as the 
NTSC (National Television System Committee) method, the PAL (Phase Alternation Line) method and the like. The 
reason for adopting 33 bits here is to express the time in the range of 24 hours in one day by the measurement of the 
s 90 kHz clock. 

The packet header 203 is further provided with: a PTS and DTS flag field 203d, which indicates a PTS flag indicating 
whether or not the PTS is added in the pertinent packet header 203; a field 203e indicating the data length of the 
header; a field 203g indicating the buffer scale which is required for the reproduction; and a field 203h indicating the 
buffer size. 

io Finally, the packet header 203 is provided with a stuffing bytes field 203i of variable length from t to 7 bytes, such 

that the stuffing byte or bytes are stuffed (packed) so as to make the byte length of the pertinent audio pack 43 be 
equal to a predetermined byte length, which is not longer than 2048 bytes, as the occasion demands. Each stuffing 
byte has a special value such as B FFh B , for example, so that it is easy to cut reading this field 203i by recognizing the 
pattern of the stuffing byte. 

is in the present embodiment, the frequency of the audio frame is set especially to satisfy three conditions as listed 

below. 

Condition [C1] It is 1/n (n: natural number) of 48 kHz and 96 kHz, which are the sampling frequencies. 
Condition [C2] It is 1/n (n: natural number) of 90 kHz, which is the frequency used for specifying the PTS. 
Condition [C3] It is higher than 387 Hz, which is the highest frequency of the audio pack 43 under the specification 
20 shown in the table of FIG. 4. 

The above mentioned condition [C1] is a condition not to discontinue the audio frame while sampling. The above 
mentioned condition [C2] is a condition not to degrade the accuracy of the PTS. The above mentioned condition [C3] 
is a condition to add the PTS into the packet 202 as for all of the audio packs 43. The "highest frequency" mentioned 
in the condition [C3] is a frequency in case of the combination of 48 kHz, 8ch and 16 bits as the specification of the 
25 audio data, in which the maximum number of the samples in the packet is the minimum (1 24 samples) in the table of 
FIG. 9, and is obtained by 1/{124 X (1/48 kHz)} = 387 Hz. 

At first, from the conditions [C1] and [C2], the frequency of the audio frame must be 1/n (n: natural number) of 
6000, which is the greatest common divisor of 48000 and 90000. Then, with considering the condition [C3] in addition, 
the frequency of the audio frame to be obtained in this case is either one of 600 Hz, 750 Hz, 1200 Hz, 1500 Hz, 3000 
30 Hz and 6000 Hz. 

Here, since the number of bits of an audio frame counter required for the reproducing apparatus is increased in 
proportional to the frequency of the audio frame, the treatment of the audio data becomes more difficult or troublesome 
as the frequency gets higher. From this point of view, the desirable frequency among the above obtained audio fre- 
quencies is 600 Hz, which is the lowest frequency among those. At this time, the reproduction time period of one audio 
35 frame is 1/600 Hz = 1 .67 ms, so that a relatively simple reproduction is possible. 

From the above explanation, the frequency of the audio frame is assumed to be 600 Hz, in the present embodiment. 
A table of FIG. 11 shows a result when the number of the audio frames accommodated in one packet is obtained 
under this condition. Since the audio data is pauselessly written to the packet, the number of the audio frames accom- 
modated in one packet is variable on the basis of the size of the audio frame and the data size. The size (the number 
to of bytes) of one audio frame is variable on the basis of a sampling frequency, the number of quantized bits and the 
number of channels. 

From the table of FIG. 11 , it is understood that there are one to thirteen boundaries in the audio frames included 
in one packet. 

In the DVD of this embodiment having the above mentioned configuration, the accessing to the audio information 
45 to be recorded by the above mentioned unit of the audio frame is possible, so that the recording, editing, reproducing 
and other operations thereof by the unit of the audio frame are possible. At the same time, the accessing by the GOF 
(Group of Audio Frames) unit as one example of a unit of frame group, in which the audio frames are integrated by a 
larger unit, as shown later, is possible, so that the recording, editing, reproducing and other operations thereof by the 
GOF unit are possible. 

50 The configuration which enables the above mentioned access is explained hereafter in detail. 

At first, the configuration of the GOF is explained with reference to FIG. 12. 

In FIG. 12, the GOF is composed of a constant number of audio frames, for example, 20 frames. Information 
indicative of the boundary between the headers and the like is not added to each of the audio frames. That is, the 
sample data is arranged in a simply consecutive condition even at the boundary between the audio frames. Also, the 
55 information indicative of the boundary between the headers and the like is not added to each GOF. That is, the sample 
data is arranged in the simply consecutive condition even at the boundary between the GOFs. In this way, the audio 
frame or the GOF is a unit proportional to a time when the audio information recorded on the DVD is sound-outputted 
by the reproducing apparatus. Then, the audio frame boundary and the GOF boundary are always coincident with each 
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other. 

Next, a relationship between the GOF and the above mentioned navi-pack (navigation-pack) is explained with 
reference to FIG. 13. 

In FIG. 13, one navi-pack 41 is placed for each constant number of GOFs, for example, 15 GOFs. The navi-pack 

5 41 is placed at an interval of, for example, approximately 0.5 seconds. Consecutive 15 GOFs are divided and written 
to each of the audio packs 43, which are multiplexed with other video packs etc. on the recording track of the DVD 
(refer to FIG. 1 ). In this way, the navi-pack 41 is placed with respect to one of the GOF boundaries, and the audio frame 
included in one GOF does not stride over the navi-pack 41 . 

Next, a relationship between the audio pack and the audio frame is explained with reference to FIG. 14. 

10 In FIG. 14, the audio data is written to each of the audio packs 43 such that one to thirteen audio frame boundaries 

are included in one audio pack 43 as mentioned above. For example, some of the last portion of the audio data included 
in the audio frame #n-2 which strides over the boundary between the pertinent audio pack 43 and another audio pack 
43 one audio pack prior to the pertinent audio pack 43, the whole portion of the audio frame #n-1 , the whole portion 
of the audio frame #n, the whole portion of the audio frame #n+1 and some of the first portion of the audio data included 

15 in the audio frame #n+2 which strides over the boundary between the pertinent audio pack 43 and another audio pack 
43 one audio pack posterior to the pertinent audio pack 43 are written to each of the audio packs 43, following the 
header 200 (that is, the pack header 201 , the packet header 203, the sub-stream ID 204, the audio frame information 
205 and the audio data information 206 which are explained with reference to FIG. 8). 

However, since the GOF does not stride over the navi-pack 41 as shown in FIG. 1 3, in case of the audio pack 43a 

20 which firstly comes with respect to the navi-pack 41 , there is no existence of such audio data that is some of the last 
portion of the frame #n-2 in FIG. 14. Also, in case of an audio pack 43b which lastly comes with respect to the navi- 
pack 41 , there is no existence of such audio data that is some of the first portion of the frame #n+2 in FIG. 14. That is, 
in the audio pack 43b which lastly comes with respect to the navi-pack 41 , if the audio data included in one GOF or 
one audio frame can be partially written, this audio data is never written. Instead, this audio data is written from the 

25 beginning thereof, in an appropriate punctuation, in the audio pack 43a coming after the next navi-pack 41 . 

Next, a construction for accessing the particular audio frame in the particular GOF, with respect to the audio data 
written to the audio pack 43 having the above mentioned structure, is explained. 

In a first embodiment, there are described, in the private data area following the packet header 203 shown in FIG. 
8: a first access unit pointer, which indicates an address of a first audio frame boundary to enable an easy detection 

30 of the first audio frame to which the above mentioned PTS is applied (that is, the audio frame which firstly appears in 
each packet); information, which indicates an audio frame number of the first audio frame as one example of the frame 
number information (for example, any one of 0 to 19 which are consecutive numbers of 20 audio frames in each GOF 
shown in FIG. 12); and information, which indicates a total number of the audio frame boundaries (i.e. a total number 
of the frame headers) included in the pertinent packet as one example of the frame boundary number information. 

35 That is, the sub-stream ID 204, the audio frame information 205 and the audio data information 206, which are 

disposed in the private data area in the audio pack 43 in the first embodiment, are constructed as shown in FIG. 15. 

In FIG. 15, the private data area is provided with: the sub-stream ID 204 of 1 byte, which indicates that the audio 
data in the packet is the PCM linear audio data recorded under the standard of the private stream 1 ; the audio frame 
information 205 of 3 bytes, including the information of 8 bits, which indicates the total number of the audio frame 

40 boundaries in the pertinent packet, and the first access unit pointer of 16 bits as mentioned above; the audio data 
information 206 of 3 bytes indicating various parameters in relation to the linear audio, such as an audio emphasis 
flag, an audio mute flag, information of 5 bits representative of the frame number of the first audio frame in the GOF 
as mentioned above, the number of quantized bits, the sampling frequency, the number of channels, dynamic range 
control information and the like; and the PCM linear audio data 207 of 2013 bytes at the maximum, which is an actual 

45 audb information pieces. 

As mentioned above, the area where the linear audio data can be stored or recorded in one audio pack 43 has 
201 3 bytes of data at the maximum. 

As a result, according to the first embodiment, in the reproducing apparatus and/or the recording apparatus as 
described later, accessing the particular audio frame in the particular GOF is easily performed by referring to the frame 

50 number of the first audio frame described in the audio data information 206. Further, when accessing the particular 
audio frame in the particular GOF, even if a certain packet is dropped out (some of the information is omitted due to 
an error when reading the information from the DVD), since the frame number of the first audio frame is written for 
each packet, a current position can be specified easily and quickly without performing a complex calculation. 

In a second embodiment, the GOF number of the GOF, to which the first audio frame in the pertinent packet 

55 belongs, is especially described in the private data area following the packet header 203 shown in FIG. 8, as the frame 
group number information, in addition to the informations included in the case of the first embodiment. 

That is, the sub-stream ID 204, the audio frame information 205 and the audio data information 206, which are 
disposed in the private data area in the audio pack 43 in the second embodiment, are constructed as shown in FIG. 1 6. 
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In FIG. 16, a field 208 of 4 bits, where the GOF number of the GOF to which the first audio frame in the pertinent 
packet belongs (for example, any one of 0 to 1 4 that are the consecutive numbers of 1 5 GOFs between the navi-packs 
shown in FIG. 13) is described, is stored or recorded in the private data area, in addition to the information shown in 
FIG. 15. An audio data information 206a including this field 208 consists of 4 bytes of data. Accordingly, a PCM linear 
s audio data 207a has 201 2 bytes of data at the maximum. 

As a result, according to the second embodiment, in the reproducing apparatus and/or the recording apparatus 
as described later, accessing the particular audio frame in the particular GOF is easily performed by referring to the 
frame number of the first audio frame and the GOF number thereof. Further, when accessing the particular audio frame 
in the particular GOF, even if relatively large dropout occurs, since the frame number of the first audio frame and the 
10 GOF number thereof are written for each packet, the current position can be specified more easily and quickly without 
carrying out the complex calculation. 

Since the DVD has such a large memory capacity that, in addition to one movie, the audio voices and captions 
(titles) in a plurality of kinds of languages corresponding to this one movie can be recorded on a single optical disk, it 
is effective to apply the above described record format especially to the DVD 1 . 
is Next, an embodiment of recording apparatus for recording the above mentioned control information, video infor- 

mation and audio information onto the DVD 1 will be explained with reference to FIG. 17. 

At first, a construction and an operation of the recording apparatus as the embodiment is explained with reference 
to FIG. 17. 

As shown in FIG. 17, a recording apparatus S1 as the present embodiment is provided with: a VTR (video Tape 
20 Recorder) 70; a memory 71 ; a signal process unit 7 1 ; a hard disk (HD) device 73; a hard disk (H D) device 74; a controller 
75; a multiplexer 76; a modulator 77; and a mastering device 78. 
Nextly, an operation of the present embodiment is explained. 

Record information R, which is a raw material such as audio information, video information etc. to be recorded on 
the DVD 1 , is temporarily recorded in the VTR 70. Then, the record information R temporarily record in the VTR 70 is 

25 outputted to the signal process unit 72 by a request from the signal process unit 72. 

The signal process unit 72 applies an A/D (Analog to Digital) converting process and a signal compressing process 
to the record information R outputted from the VTR 70, and time-axis-multiplexes the audio information and the video 
information to output it as a compressed multiplexed signal Sr. After that, the compressed multiplexed signal Sr out- 
putted therefrom is temporarily stored into the hard disk device 73. 

30 Along with this, the memory 71 classifies the record information R into a plurality of partial record information Pr 

in advance, and temporarily stores content information related to the partial record information Pr which is inputted 
beforehand on the basis of a cue sheet ST, on which the user defined information such as the packet header, the sub 
stream ID, the audio frame information, the audio data information and so on shown in FIGS. 15 and 16, are written. 
Then, the memory 71 outputs it as a content information signal Si on the basis of a request from the signal process 

3S unit 72. 

Then, the signal process unit 72 generates and outputs a PCI information signal Spci and a DSI information signal 
Sdsi corresponding to the partial record information Pr with referring to a time code Tt, on the basis of the time code 
Tt corresponding to the record information R outputted from the VTR 70 and the content information signal Si outputted 
from the memory 71 . Then, the PCI information signal Spci and the DSI information signal Sdsi are temporarily stored 

40 in the hard disk device 74. 

The above described processes are performed with respect to the whole record information R. 
When the above described processes are finished as for the whole record information R, the controller 75 reads 
out the compressed multiplexed signal Sr from the hard disk device 73, reads out the PCI data signal Spci and the DSI 
data signal Sdsi as well as other control informations from the hard disk device 74, generates additional information 

45 DA, which includes independently each of the PCI data 50, the DSI data 51 and the other control informations, on the 
basis of these read out signals, and temporarily stores the additional information DA into the hard disk device 74. This 
is because there may be control information, which content is determined in dependence upon a generation result of 
the compressed multiplexed signal Sr among various control informations. 

On the other hand, the controller 75 performs a time management for each of the operations of the signal process 

50 unit 72, the hard disk device 73 and the hard disk device 74, and reads out the additional information DA, which includes 
the PCI information signal Spci and the DSI information signal Sdsi, from the hard disk device 74, so that the controller 
75 generates and outputs an additional information signal Sa corresponding to the read out additional information DA, 
and generates and outputs an information selection signal Sec to time-axis-multiplex the compressed multiplexed 
signal Sr and the additional information signal Sa. 

55 After that, the compressed multiplexed signal Sr and the additional information signal Sa are time-axis-multiplexed 

by the multiplexer 76 to be outputted as an information added compressed multiplexed Sap. If there exists the sub 
picture information to be recorded, it is inputted, by other means such as a hard disk device not illustrated, to the signal 
process unit 72, so that it is processed in the same manner as the video and audio information thereat. 
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Then, the modulator 77 adds an error correction code (ECC), such as a Reed Solomon code, and applies a mod- 
ulation such as an eight to sixteen (8-1 6) modulation with respect to the information added compressed multiplexed 
signal Sap outputted from the multiplexer 76, and generates and outputs a disk record signal Sm to the mastering 
device 78. 

s Finally, the mastering device 78 records the disk record signal Sm to a stamper disk, which becomes a master (i. 

e. a cutting dye) for the production of an optical disk. Then, by use of this stamper disk, an optical disk as a replica 
disk, which can be on sale in the general market, i.e. the DVD1 , can be produced by a replication device not illustrated. 

Nextly, the processes of recording the audio information and the video information into the audio pack and the 
video pack respectively related to the present embodiment are explained in more detail. 

10 In the above explained recording processes, the signal process unit 72 especially performs a process of dividing 

and compressing the video information, and stores (packs) the divided and compressed video information into the 
video pack 42 shown in FIG. 1 . The signal process unit 72 also performs a process of dividing the audio information 
and arranging the divided audio information to a predetermined sample data arrangement, and stores (packs) the 
divided and arranged audio information into the audio pack 43 shown in FIGS. 1 and 8. Then, the signal process unit 

is 72 time-axis-multiplexes these video pack 42 and the audio pack 44, and outputs the multiplexed packs as the com- 
pressed multiplexed signal Sr. 

At this time, the audio information to be stored or packet in the audio pack 43 is the PCM linear audio data shown 
in the table of FIG. 4, as mentioned above. The audio frame frequency thereof is 600 Hz, and the PTS is always added 
to each of the packet headers. The first access unit pointer, the information indicative of the audio frame number of 

20 the first audio frame and the information indicative of the total number of the audio frame boundaries included in the 
pertinent packet as shown in FIG. 15 are described in each of the packet in correspondence with the content of the 
cue sheet ST, or the information indicative of the GOF number in addition to these informations as shown in FIG. 16 
are described in each of the packet. 

By this, in the information reproducing apparatus for reproducing this DVD, accessing the particular audio frame 

25 jp the particular GOF is easily performed by referring to the frame number in the first audio frame, or by referring to 
the frame number of the first audio frame and the GOF number thereof, as described later. 

Next, an embodiment of reproducing apparatus for reproducing the information recorded on the DVD1 by the above 
mentioned recording apparatus S1 will be explained with reference to FIG. 18 to 26. 

At first, a construction and an operation of the reproducing apparatus as the embodiment is explained with reference 

30 to FIG. 18. 

As shown in FIG. 18, a reproducing apparatus S2 as the present embodiment is provided with: an optical pickup 
80; a demodulate and correct unit 81 ; stream switches 82 and 84; a track buffer 83; a system buffer 85; a demultiplexer 
86; a VBV (Video Buffer Verifier) buffer 87; a video decoder 88; a sub picture buffer 89; a sub picture decoder 90; a 
mixer 91 ; an audio buffer 92; an audio decoder 93; a PCI (Presentation Control Information) buffer 94; a PCI decoder 

35 95; a high light buffer 96; a high light decoder 97; an input unit 98; a display unit 99; a system controller 100; a drive 
controller 101; a spindle motor 102; and a slider motor 103. The construction shown in FIG. 18 only illustrates the 
portions related to the video and audio reproduction of the reproducing apparatus S2. The description and the detailed 
explanation as for servo circuits to servo-control the optical pickup 80, the spindle motor 102, the slider motor 103 and 
the like are omitted since they are constructed in the same manner as the conventional arts. 

40 Nextly, an overall operation of the present embodiment is explained. 

The optical pickup 80 includes a laser diode, a polarization beam splitter, an objective lens, a photo-detector and 
the like not illustrated, and irradiates a light beam B as a reproduction light with respect to the DVD 1. The optical 
pickup 80 receives a reflection light of the light beam B from the DVD 1 , and outputs a detection signal Sp corresponding 
to information pits formed on the DVD 1 . At this time, the tracking servo control and the focus servo control are operated 

45 with respect to the objective lens etc. of the optical pickup 80 in the same manner as the conventional art so that the 
light beam B can be irradiated precisely onto the information track of the DVD 1 and that the light beam B can be 
focused on the information record surface of the DVD 1 . 

The detection signal Sp outputted from the optical pickup 80 is inputted to the demodulate and correct unit 81, 
where a signal demodulation process and an error correct process are applied to it to generate a demodulation signal 

50 Sdm, which is outputted to the stream switch 82 and the system buffer 85. 

The opening and closing operation of the stream switch 82, to which the demodulation signal Sdm is inputted, is 
controlled by a switch signal Ssw1 from the drive controller 101 . When it is closed, the stream switch 82 passes there- 
through the inputted demodulation signal Sdm as it is to the track buffer 83. When it is opened, the demodulation signal 
Sdm is not outputted therethrough, so that unnecessary or useless information (signal) is not inputted to the track 

55 buffer 83. 

The track buffer 83, to which the demodulation signal Sdm is inputted, consists of a FIFO (First In First Out) memory, 
for example. The track buffer 83 temporarily stores the inputted demodulation signal Sdm, and continuously outputs 
the stored demodulation signal Sdm when the stream switch 84 is closed. The track buffer 83 compensates a difference 
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or fluctuation in the data amount between respective GOP under the MPEG 2 method, and continuously outputs the 
demodulation signal Sdm, which is discontinuously inputted due to a track jump in the aforementioned seamless re- 
production, in case of reading the data divided into the interleaved units it, so as to avoid the interruption of the repro- 
duction due to the discontinuity. 

5 The opening and closing operation of the stream switch 84, to which the demodulation signal Sdm is continuously 

inputted, is controlled by a switch signal Ssw2 from the system controller 1 00 such that the various buffers at its posterior 
stage may not be over-flown or, on the contrary, may not become empty to stop the decoding process, in the separating 
process by the demultiplexer 86. 

On the other hand, the system buffer 85, to which the demodulation signal Sdm is inputted in parallel with the track 

10 buffer 83, accumulates the management information (e. g. the video manager 2), the control data 11 of the VTS 3 and 
the like (refer to FIG. 1) which are detected firstly upon loading the DVD 1 and which are related to the whole information 
recorded on the DVD 1 . Then, the system buffer 85 outputs the accumulated data as a control information Sc to the 
system controller 1 00, and temporarily stores the DSI packet 51 for each navi-pack 41 (refer to FIG. 1 ) as the occasion 
demands while reproducing the information, to output it also as the control information Sc. 

is The demultiplexer 86, to which the demodulation signal Sdm is continuously inputted through the stream switch 

84, separates the video information, the audio information, the sub picture information and the PCI packet 50 for each 
navi-pack 41 respectively from the inputted demodulation signal Sdm, and outputs them as a video signal Sv, a sub 
picture signal Ssp, an audio signal Sad and a PCI signal Spc respectively to the VBV buffer 87, the sub picture buffer 
89, the audio buffer 92 and the PCI buffer 94. The demultiplexer 86 sends the packet header etc. of each packet as 

20 the control signal Sdmx to the system controller 100. There may be a case where, in the demodulation signal Sdm, 
different streams of the audio information or the sub picture information in a plurality of different languages are included 
as the audio or sub picture information. In that case, a desirable language is selected for the audio or sub picture 
information by a stream selection signal Sic from the system controller 1 00, so that the audio or sub picture information 
in the desirable language is outputted to the audio buffer 92 or the sub picture buffer 89. 

25 The VBV buffer 87, to which the video signal Sv is inputted, consists of a FIFO memory, for example. The VBV 

buffer 87 temporarily stores the video signal Sv and outputs it to the video decoder 88. The VBV buffer 87 compensates 
the difference or fluctuation in the data amount between respective pictures of the video signal Sv compressed by the 
MPEG 2 method. Then, the video signal Sv in which the differences in the data amount are compensated, is outputted 
to the video decoder 88, and is decoded by the MPEG 2 method to be outputted as a decoded video signal Svd to the 

30 mixer 91 . 

On the other hand, the sub picture buffer 89, to which the sub picture signal Ssp is inputted, temporarily stores 
the inputted sub picture signal Ssp, and outputs it to the sub picture decoder 90. The sub picture buffer 89 is to syn- 
chronize the sub picture information included in the sub picture signal Ssp with the video information corresponding 
to the sub picture information, and to output it. Then, the sub picture signal Ssp synchronized with the video information 
35 is inputted to the sub picture decoder 90 and is decoded to be outputted as a decoded sub picture signal Sspd to the 
mixer 91 . 

In a case where the sub picture signal Ssp includes video information to construct a frame, a selection button etc. 
for displaying the menu picture plane, the sub picture decoder 90 changes a display condition of the selection button 
etc. to be displayed, in the sub picture signal Sspd on the basis of a high light control information Sch from the system 

40 controller 1 00 to output it. 

The decoded video signal Svd outputted from the video decoder 88 and the decoded sub picture signal Sspd 
outputted from the sub picture decoder 90 (which is in synchronization with the corresponding decoded video signal 
Svd) are mixed together by the mixer 91 , and are outputted as a final video signal Svp to be displayed to a display 
device such as a CRT (Cathode Ray Tube) device not illustrated. 

45 The audio buffer 92, to which the audio signal Sad is inputted, consists of a FIFO memory, for example. The audio 

buffer 92 temporarily stores the audio signal Sad and outputs it to the audio decoder 93. The audio buffer 92 is to 
synchronize the audio signal Sad with the video signal Sv or the sub picture signal Ssp including the corresponding 
video information, and delays the audio signal Sad in accordance with the output condition of the corresponding video 
information. Then, the audio signal Sad, which is time-adjusted to synchronize with the corresponding video information, 

50 is outputted to the audio decoder 93. Then, a predetermined decoding process is applied thereat to the audio signal 
Sad, and it is outputted as a decoded audio signal Sadd to a speaker etc. not illustrated. If it is detected by the system 
controller 100 that it is necessary to temporarily stop (pause) the audio voice in the reproduction immediately after 
accessing, a pause signal Sea is outputted from the system controller 100 to the audio decoder 93, so that the output 
of the decoded audio signal Sadd is stopped temporarily at the audio decoder 93. 

55 The PCI buffer 94, to which the PCI signal Spc is inputted, consists of a FIFO memory, for example. The PCI buffer 

94 temporarily stores the inputted PCI signal Spc and outputs it to the PCI decoder 95. The PCI buffer 94 is to syn- 
chronize the PCI packet 50, which is included in the PCI signal Spc, with the video information, the audio information 
and the sub picture information corresponding to the PC I packet 50, and apply the PCI packet 50 to the video information 
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and the like. Then, from the PCI signal Spc, which is synchronized with the corresponding the video information and 
the like by the PCI buffer 94, a high light information included in the PCI packet 50 is separated or extracted by the 
PCI decoder 95, and is outputted as a high light signal Shi to the high light buffer 96. The portion of the PCI packet 50 
other than the high light information is outputted as a PCI information signal Spci to the system controller 100. 

5 The high light buffer 96, to which the high light signal Shi is inputted, consists of a FIFO memory, for example. The 

high light buffer 96 temporarily stores the inputted high light signal Shi and outputs it to the high light decoder 97. The 
high light buffer 96 is to time-compensate the high light signal Shi so as to precisely perform a change in the display 
condition of the selection item, which corresponds to the high light information, in correspondence with the sub picture 
signal Ssp which includes the video information for the high light information. Then, the time -compensated high light 

10 signal Shi is decoded by the high light decoder 97, and the information included in the high light signal Shi is outputted 
as a decoded high light signal Shid to the system controller 100. Here, the system controller 100 outputs the afore- 
mentioned high light control signal Sch to change the display condition by the high light information, on the basis of 
the decoded high light signal Shid. 

On the basis of the control information Sc inputted from the system buffer 85, the control signal Sdmx inputted 

15 from the demultiplexer 86, the PCI information signal Spci inputted from the PCI decoder 95 and an input signal Sin 
inputted from the input unit 98 such as a remote controller, the system controller 1 00 outputs the aforementioned switch 
signal Ssw2, the language selection signal Sic, the pause signal Sea and the high light control signal Sch to correctly 
perform the reproduction corresponding to those input signals, and also outputs a display signal Sdp to display an 
operation condition etc. of the reproducing apparatus S2 to the display unit 99 such as the liquid crystal device. 

20 Further, the system controller 1 00 outputs a seamless control signal Scsl corresponding to the track jump process 

such as a search process etc., to the drive controller 1 01 , when it detects by the control signal Sc or the aforementioned 
DSI data etc. that it is necessary to perform the track jump process such as a search in order to perform the seamless 
reproduction. 

Then, the drive controller 101 , to which the seamless control signal Scsl is inputted, outputs a drive signal Sd to 

25 the spindle motor 102 or the slider motor 103. By this drive signal Sd, the spindle motor 102 or the slider motor 103 
moves the optical pickup 80 such that the record position to be reproduced on the DVD 1 is irradiated with the light 
beam B (refer to an arrow of a broken line in FIG. 18), and the spindle motor 102 CLV-controls (Constant Linear Velocity- 
controls) the rotation number of the DVD 1 . Along with this, the drive controller 101 outputs the aforementioned switch 
signal Ssw1 on the basis of the seamless control signal Scsl, so as to open the stream switch 82 when the demodulation 

30 signal Sdm is not outputted from the demodulate and correct unit 81 while the optical pickup 80 is being moved, and 
so as to close the stream switch 82 when the demodulation signal Sdm is started to be outputted, so that the demod- 
ulation signal Sdm is outputted to the track buffer 83. 

Next, the construction and the operation of the first embodiment of a reproducing apparatus S2 in accordance with 
the present invention are explained with reference to FIG. 19. The first embodiment of this reproducing apparatus S2 

35 is constructed to perform an appropriate access operation for the first embodiment of the DVD 1 constructed to have 
the audio frame information and the audio data information as shown in FIG. 15. 

In the present embodiment, in FIG. 18, the PCM linear audio data shown in FIG. 4 stored in the audio pack 43 is 
outputted to the audio buffer 92, as the audio signal Sad, by the demultiplexer 86. Along with this, since the PTS is 
always added to each packet, the informations respectively indicating the first access unit pointer, the frame number 

40 and the total number of the frame boundaries which are shown in FIG. 1 5, are outputted, as a portion of the control 
signal Sdmx, for each packet, from the demultiplexer 86. These informations are adapted to be inputted to the system 
controller 100. Alternatively, these informations may be inputted through the audio buffer 92 to the system controller 
'100, and not as the portion of the control signal Sdmx. 

The system controller 100 is constructed to perform the access operation shown in a flow chart of FIG. 19, on the 

45 basis of these inputted informations. 

More concretely, as shown in FIG. 19A, the system controller 100 is provided with: a counter 100a for counting a 
value gn of the GOF number; a counter 100b for counting a value pafn of the audio frame number, a byte counter 100c 
for counting a value be of the number of bytes; and a frame counter 100d for counting a value afc of the number of 
frames, in the present embodiment, as described below in more detail. 

50 a control operation performed by the system controller 100 is explained hereafter. Here, a case of accessing a 

fourteenth audio frame in a third GOF of a certain navi-pack is. explained as an example. 

In FIG. 19, when a time specification (a o'clock : b minute : c second : d GOF : e audio frame) is firstly specified 
by a user, the optical pickup is moved toward a navi-pack including the access target, and the input of the navi-pack 
(NAV_Pack) is waited for by the system controller 100 (Step S11). Since data corresponding to a time code is also 

55 stored in the navi-pack, the access by the unit of navi-pack can be easily carried out by referring to this data. When 
the input of the navi-pack is detected (Step S11 ; YES), at first, a value gn of a counter having the GOF number and 
a value pafn of a counter having the previous audio frame number are respectively reset to [0] (Step S12). Then, the 
reading process is continued, and the input of the audio pack in the packs multiplexed in the VOBU, to which the navi- 
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pack is added, is waited for by the system controller 100 (Step S13). When the audio pack is inputted (Step S13; 

YES), an audio frame number afn (any one of 0 to 1 9) obtained by reading from the audio data information (refer to 

FIG. 1 5) in the audio pack is compared with the value pafn of the audio frame number stored in the counter (Step S14). 

Here, if pafn > afn (Step S14 ; YES), the operation proceeds to a step 15, where the gn is incremented by one (Step 
s S15). The value of the counter, as the number of the previous audio frame, is set to pafn = afn (Step S16). On the 

other hand, if it is not pafn > afn at the step S14 (Step S14 ; NO), the operation does not pass through the step S15, 

and the process at the step S16 is performed. 

Then, it is judged whether or not gn = 1 (Step S17). If gn = 1 (Step S17 ; YES), it is judged whether or not (afn + 

naf - 20) ^ 14 (Step S18). Here, the naf is the total number (any one of 1 to 1 3) of the audio frame boundaries existing 
10 in the pertinent pack, and is obtained by reading from the audio frame information (refer to FIG. 15) in the pertinent 

audio pack. If it is not (afn + naf - 20) ^ 14 at the step S18 (Step S18 ; NO), it is further judged whether or not gn = 2 

(Step S1 9). On the other hand, if it is not gn = 1 at the step S17 (Step S17 ; NO), the operation does not pass through 

the step S18, and the process at the step S19 is performed. 

Next, if it is not gn = 2 at the step S19 (Step S19 ; NO), the operation returns back to the step S13, and the 
15 processes on and after that step are repeated. On the other hand, if gn = 2 at the step S19 (Step S19 ; YES), it is 

judged whether or not (afn + naf) ^ 14 (Step S20). Here, if it is not (afn + naf) ^ 14 (Step S20 ; NO) the operation 

returns back to the step S13, and the processes on and after that step are repeated. 

If (afn + naf - 20) ^ 14 at the step S18 (Step S18 ; YES), this corresponds to a case where both of the boundary 

of the head of the GOF (#3) of the access target and the audio frame (#14) of the access target are located in the 
20 audio pack at a current reading position, as shown in FIG. 20. Thus, the operation proceeds to a step S21 . Then, a 

count value n for judging an access completion is set to n = 20 + 13 - afn so as to indicate "the total number of audio 

frames in one GOF (20)" + the number of audio frames to the audio frame immediately before the access target in 

the GOF (13)" - "the audio frame number of the audio frame at which the boundary firstly appears in the audio pack 

(afn)" (Step S21 ). After that, the operation proceeds to a step S23. 
25 On the other hand, if afn + naf ^ 14 at the step S20 (Step S20 ; YES), this corresponds to a case where the 

boundary of the head of the GOF (#3) of the access target is located in the audio pack previous to the currently read 

audio pack, this GOF (#3) is included from its half way in the currently read audio pack, and the audio frame (#14) of 

the access target is located in the currently read audio pack, as shown in FIG. 21. Thus, the operation proceeds to a 

step S22. Then, the count value n for judging the access completion is set to n = 1 3 - afn (Step S22) so as to indicate 
30 "the number of audio frames to the audio frame immediately before the access target in the GOF (13)" - "the audio 

frame number of the audiof rame at which the boundary firstly appears in the audio pack (afn) n . After that, the operation 

proceeds to the step S23. 

At the step S23, the number of bytes per the audio frame is calculated by a calculation equation : bpaf = (fs xqxc) 
/ (8 x 600) (Step S23), wherein fs is the sampling frequency, q is the number of quantized bits and c is the number of 

35 channels, all of which are obtained by reading from the audio data information in the audio pack. Then, a value be of 
a byte counter and a value afc of a frame counter are reset to [0] (Step S24). 

Next, an address indicated by the first access unit pointer obtained by reading from the audio frame information 
in the audio pack is searched (Step S25). As a result, when the reading position coincides with the address of the 
frame boundary of the audio frame that firstly appears in the pertinent packet, every time one byte moving process is 

40 performed from this position, the value be of the byte counter is incremented by one (Step S26), and it is judged whether 
or not the movement for one audio frame is completed, namely, it is judged whether or not be = bpaf (Step S27). Here, 
if it is not be = bpaf (Step S27 ; NO), since the reading position is still located in the same audio frame, the operation 
returns back to the step S26. Further, the one byte moving process is performed from this position, and the value be 
of the byte counter is incremented. On the other hand, if be = bpaf (Step S27 ; YES), since this means that the movement 

45 for one audio frame is just completed, the value afc of the frame counter is incremented by one, while the value be of 
the byte counter is reset (Step S28). 

After that, it is judged whether or not afc = n (Step S29) by using the count value n for judging the access completion 
obtained at the step S21 or S22. If it is not afc = n (Step S29 ; NO), the operation returns back to the step S26. Further, 
the one byte moving process is performed from this position, and the value be of the byte counter is incremented. On 

50 the other hand, if afc = n (Step S29 ; YES), since this means that the access to the audio frame of the access target 
in the GOF of the access target is completed, the pertinent access operation is ended. 

As mentioned above, according to the first embodiment of the reproducing apparatus S2, the access to the par- 
ticular audio frame in the particular GOF can be performed easily and quickly, by using a relatively small byte counter 
and frame counter, on the basis of the number naf of the frame boundaries described in the audio frame information 

55 at each packet and the audio frame number afn described in the audio data information at each packet as shown in 
FIG. 15. 

Next, the construction and the operation of the second embodiment of a reproducing apparatus S2 in accordance 
with the present invention are explained with reference to FIG. 22. The second embodiment of this reproducing appa- 
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ratus S2 is constructed to perform an appropriate access operation for the second embodiment of the DVD 1 constructed 
to have the audio frame information and the audio data information shown in FIG. 16. 

In this embodiment, in FIG. 18, the PCM linear audio data shown in FIG. 4 stored in the audio pack 43 is outputted 
to the audio buffer 92, as the audio signal Sad, by the demultiplexer 86. Along with this, since the PTS is always added 

s to each packet, the information indicating the GOF number in particular, in addition to the first access pointer, the frame 
number and the total number of the frame boundaries which are shown in FIG. 16, are outputted, as the portion of the 
control signal Sdmx, for each packet, from the demultiplexer 86. These informations are adapted to be inputted to the 
system controller 100. Alternatively, these information may be inputted through the audio buffer 92 to the system con- 
troller 100, and not as the portion of the control signal Sdmx. 

10 The system controller 100 is constructed to perform the access operation shown in the flow chart of FIG. 22, on 

the basis of these inputted informations. 

A control operation performed by the system controller 100 is explained hereafter. In FIG. 22, the same steps as 
those in FIG. 19 carry the same step numbers and the explanations thereof are omitted. 

As shown in FIG. 22, in the access operation in this embodiment, it is not required to perform the processes 

is indicated at the steps S12, S14, S15 and S16 in FIG. 19. There are steps S37 and S39, instead of the steps S17 and 
S1 9. In these steps S37 and S39, it is judged whether or not the GOF number gn obtained by reading from the audio 
data information in the pertinent audio pack (any one of 0 to 14) is as gn = 1 and gn = 2, respectively. This reason is 
explained below That is, the processes indicated at the steps S12, S14, S15, S16, S17 and S19 in FIG. 19 are those 
of using the counter for counting the GOF number, and, while incrementing the count value, obtaining the audio packs 

20 in which the GOF numbers in the packets thereof become 1 and 2, respectively. Namely, in this embodiment, since 
the information indicating this GOF number is described in the audio data information at each packet as shown in FIG. 
1 6, it is possible to read this information to thereby detect the GOF number at the packet. Further, it is enough to search 
the audio packs in which these read GOF numbers become 1 and 2 respectively. For the processes other than the 
above mentioned processes, the access operation in this embodiment shown in FIG. 22 is identical to the case shown 

25 in FIG. 19. 

As mentioned above, according to the second embodiment of the reproducing apparatus S2, the access to the 
particular audio frame in the particular GOF can be performed more easily and quickly, by using a relatively small byte 
counter and frame counter, on the basis of the number naf of the frame boundaries described in the audio frame 
information at each packet and the audio frame number afn described in the audio data information at each same 

30 packet, as well as the GOF number gn described in the audio data information at each packet, as shown in FIG. 16. 

Next, in order to consider a remarkable effect of the embodiments shown in FIG. 1 9 and FIG. 22, the operation of 
the reproducing apparatus S2 when accessing the particular audio frame in the particular GOF, for the DVD in which 
the informations indicating the first access pointer, the frame number, the total number of the frame boundaries and 
the GOF number shown in FIG. 15 or FIG. 16 are not described, by using the audio frame information and the audio 

35 data information other than the above mentioned informations, is explained with reference to a flow chart in FIG. 23, 
as a comparison example. In FIG. 23, the same steps as those in FIG. 19 carry the same step numbers, and the 
explanations thereof are omitted. 

In FIG. 23, after the process at the step S13, the operation jumps to the step S23. After this step S23, the values 
of the byte counter (be), the audio frame counter (afc) and the GOF counter (gc) are reset to [0] (Step S41 ). After that, 

40 the processes at the steps S26, S27 and S28 are performed for the audio sample data in the current audio pack. 

After that, it is judged whether or not it is the GOF (#2) immediately before the GOF (#3) of the access target, 
namely, whether or not it is gc = 2 (Step S42). Here, if it is gc = 2 (Step S42 ; YES), it is further judged whether or not 
it is the audio frame (#1 3) immediately before the audio frame (#1 4) of the access target, namely, whether or not it is 
afc = 1 3 (Step S43). If it is afc = 1 3 (Step S43 ; YES), the access to the target audio frame in the target GOF is ended. 

45 |f any condition of the steps S42 and S43 is not satisfied (Step S42 ; NO or Step S43 NO), the operation proceeds to 
a step S44. Then, it is judged whether or not it is the last audio frame of the GOF, namely, whether or not it is afc = 20 
(Step S44). If it is not afc = 20 (Step S44 ; NO), the operation returns back to the step S26, and the processes on and 
after that step are repeated. On the other hand, if it is afc = 20 at the step S44 (Step S44 ; YES), the GOF counter (gc) 
is incremented by one, and the audio frame counter (afc) is reset (Step S45). After that, the operation returns back to 

50 the step S26, and the processes on and after that step are repeated. 

As mentioned above, according to the comparison example indicated in FIG. 23, when the time specification (a 
o'clock : b minute : c second : d GOF : e audio frame) is specified by the user, in order to access the audio frame of 
the GOF specified by the above time specification, the size (i.e. the byte number) of the audio frame required for the 
access algorithm is obtained, by referring to the sampling frequency, the number of quantized bits and the number of 

55 channels that are described in the audio data information of the audio pack, after accessing the unit of the navi-pack. 
Then, the byte count is performed from the GOF immediately after the navi-pack. When this byte number reaches the 
size (L e. the byte number) of the audio frame, the audio frame counter is incremented. Further, when this frame number 
reaches 20, the GOF counter is incremented. 
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In this way, according to the above mentioned comparison example, there is such a defect that the reproducing 
apparatus must always perform the byte counting operation during the access operation. In contrast with this, according 
to the first and second embodiments of the reproducing apparatus S2 in accordance with the present invention, there 
is such an advantage that it is enough to perform the byte counting operation only in the packet which includes the 

5 audio frame of the access target, as indicated in FIG. 1 9 and FIG. 22. 

Moreover, according to the above mentioned comparison example, it has such a defect that, when the data cannot 
be read since a dropout equal to or more than a length of an error correction occurs elsewhere between the navi-packs, 
the current position cannot be specified until the next navi-pack comes, resulting in that the access is impossible. In 
contrast with this, according to the first embodiment of the reproducing apparatus S2 of the present invention, even if 

10 the data cannot be read since the dropout equal to or more than the length of the error correction occurs elsewhere 
between the navi-packs, the audio frame number in which the boundary firstly appears for each packet is described 
as shown in FIG. 15, so that the current position after the dropout can be specified easily by referring to this audio 
frame number in each packet 202 as shown in FIG. 24. Further, according to the second embodiment of the reproducing 
apparatus S2 of the present invention, in addition to the merit of the first embodiment, even if such a large dropout as 

is striding over a plurality of GOFs between the navi-packs occurs, since the GOF number is described for each packet 
as shown in FIG. 16, the current position after the dropout can be specified easily by referring to the audio frame 
number and the GOF number in each packet 202 as shown in FIG. 25. This is advantageous over the first embodiment 
in which the variation of the GOF is detected by comparing the audio frame number read from the audio data information 
with the current audio frame number, and the above mentioned large dropout cannot be dealt with. 

20 in the comparison example, it may be contemplated to refer to the PTS described in the packet header to thereby 

treat the dropout. In this case, as indicated in FIG. 26, even if one packet 202 is dropped out, assuming that the PTS 
in a first packet is PTS 1 , if a PTS 2 of the following packet can be detected, the position of the first packet 202 can 
be detected or calculated from an equation of (PTS1 - PTS2) / 90000, and thereby the access on and after it is possible. 
However, addition and subtraction of 33 bits must be performed in order to implement this dropout treatment. Thus, 

25 an algorithm for positioning is complex, and that it is very difficult to realize this dropout treatment especially by use of 
a 4-bit or 8-bit microcomputer as a controller. In contrast with this, according to the first and second embodiments of 
the reproducing apparatus S2 of the present invention, since the audio frame number in which, the boundary firstly 
appears in each packet and the GOF number thereof are described, the first and second embodiments are very ad- 
vantageous in that the current position after the dropout can be specified easily by using the simple algorithm of referring 

30 to the audio frame number in each packet and the GOF number thereof. 

As explained above, it is understood that the first and second embodiments of the reproducing apparatus S2 of 
the present invention are very excellent, by comparing them with the comparison examples in FIG. 23 and FIG. 26. 



35 Claims 

1. An information record medium (1 : DVD) having a record track (1 a) to be reproduced by an information reproducing 
apparatus (S2), which comprises a read fleans (80), reproduces audio information while relatively moving said 
read means along said record track recorded with at least said audio information by a predetermined unit of audio 

<o frame and by a unit of frame group (GOF) consisting of consecutive audio frames in a predetermined number, and 

is able to access a particular audio frame in a particular frame group, characterized in that: 

a plurality of audio packets (43, 202) are arranged along said record track, in each of which audio information 
pieces (207) constructing said audio information sampled by a predetermined sampling frequency and audio 
45 control information (204, 205, 206) for controlling a reproduction and an access of said audio information 

pieces by said information reproducing apparatus are respectively recorded; and 

said audio control information comprises frame boundary number information (205) indicating the number of 
boundaries of audio frames existing in said audio packet including said audio control information, and frame 
number information (206, 206a) indicating a serial frame number of said audio frame at which the boundary 
50 firstly appears in said audio packet including said audio control information. 

2. An information record medium (1 : DVD) according to claim 1 , characterized in that said audio control information 
(204, 205, 206) further comprises frame group number information (208) indicating a serial group number of said 
frame group (GOF) including said audio frame at which the boundary firstly appears. 

55 

3. An information record medium (1: DVD) according to claim 1 or 2, characterized in that said audio control infor- 
mation (204, 205, 206) further comprises information (206, 206a) indicating said sampling frequency, the number 
of quantized bits of said audio information and the number of channels of said audio information, respectively. 
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An information record medium (1 : DVD) according to any one of claims 1 to 3, having said record track (1a) to be 
reproduced by said information reproducing apparatus (S2), which reproduces video information by a predeter- 
mined video frame unit in addition to said audio information while relatively moving said read means (80) along 
said record track recorded with at least said video information by said video frame unit in addition to said audio 
information, characterized in that 

a plurality of video packets (42, 44) are multiplexed with said audio packets (43) and arranged along said 
record track, in each of which video information pieces constructing said video information and video control in- 
formation for controlling a reproduction of said video information pieces by said information reproducing apparatus 
are respectively recorded. 

An information recording apparatus (S1) for recording information onto an information record medium (1: DVD) 
having a record track (1a) to be reproduced by an information reproducing apparatus (S2), which comprises a 
read means (80), reproduces audio information while relatively moving said read means along said record track 
recorded with at least said audio information by a predetermined unit of audio frame and by a unit of frame group 
(GOF) consisting of consecutive audio frames in a predetermined number, and is able to access a particular audio 
frame in a particular frame group, characterized in that said information recording apparatus comprises: 

a record means (72 to 78) for respectively recording, into a plurality of audio packets (43, 202) arranged along 
said record track, audio information pieces (207) constructing said audio information sampled by a predeter- 
mined sampling frequency and audio control information (204, 205, 206) for controlling a reproduction and an 
access of said audio information pieces by said information reproducing apparatus, said audio control infor- 
mation comprising frame boundary number information (205) indicating the number of boundaries of audio 
frames existing in said audio packet including said audio control information, and frame number information 
(206) indicating a serial frame number of said audio frame at which the boundary firstly appears in said audio 
packet including said audio control information; and 

an input means (ST, 71) for inputting at least one portion of said audio control information. 

An information recording apparatus (S1) according to claim 5, characterized in that said record means (72 to 78) 
records said audio control information (204, 205, 206), which further comprises frame group number information 
(208) indicating a serial group number of said frame group (GOF) including said audio frame at which the boundary 
firstly appears. 

An information reproducing apparatus (S2) for reproducing information from an information record medium (1: 
DVD) having a record track (1a) recorded with at least audio information by a predetermined unit of audio frame 
and by a unit of frame group (GOF) consisting of consecutive audio frames in a predetermined number, wherein: 
a plurality of audio packets (43, 202) are arranged along said record track, in each of which audio information 
pieces (207) constructing said audio information sampled by a predetermined sampling frequency and audio control 
information (204, 205, 206) for controlling a reproduction and an access of said audio information pieces are 
respectively recorded; and said audio control information comprises frame boundary number information (205) 
indicating the number of boundaries of audio frames existing in said audio packet including said audio control 
information, and frame number information (206) indicating a serial frame number of said audio frame at which 
the boundary firstly appears in said audio packet including said audio control information, characterized in that 
said information reproducing apparatus comprises: 

a read means (80) for reading information recorded at a predetermined reading position on said record track; 

a movement means (101 , 102, 103) for relatively moving said read means along said record track or across 
said record track; 

an extract means (86) for extracting said audio packet from the information read by said read means; 

an audio decode means (92, 93) for decoding said audio information included in said extracted audio packet 
in accordance with said audio control information; 

a specification means (98) for specifying the particular audio frame in the particular frame group; and 

a control means (100) for controlling said read means, said movement means and said extract means to 
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access the specified particular audio frame in the particular frame group, on the basis of said frame boundary 
number information and said frame number information, which are included in said audio control information 
in said extracted audio packet. 

5 8. An information reproducing apparatus (S2) according to claim 7, characterized in that said control means (100) 
comprises a counter means for counting information amount in said audio packet, and controls said record means 
(80), said movement means (101 , 102, 103) and said extract means (86) to successively approach said specified 
audio frame in said frame group (GOF) while counting the information amount in said accessed audio packet by 
said counter means after accessing said audio packet including said specified audio frame in said frame group, 

10 on the basis of said frame boundary number information (205) and said frame number information (206). 

9. An information reproducing apparatus (S2) according to claim 7, characterized in that: 

said audio control information (204, 205, 206) further comprises frame group number information (208) indi- 
75 eating a serial group number of said frame group (GOF) including said audio frame at which the boundary 

firstly appears; and 

said control means (100) controls said read means (80), said movement means (101, 102, 103) and said 
extract means (86) to access said specified audio frame in said audio group, further on the basis of said frame 
group number information. 

20 

10. An information reproducing apparatus (S2) according to claim 9, characterized in that said control means (100) 
comprises a counter means for counting information amount in said audio packet (43), and controls said record 
means (80), said movement means (101, 102, 103) and said extract means (86) to successively approach said 
specified audio frame in said frame group (GOF) while counting the information amount in said accessed audio 

25 packet by said counter means after accessing said audio packet including said specified audio frame in said frame 

group, on the basis of said frame boundary number information (205), said frame number information (206) and 
said frame group number information (208). 
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